Some observations on computer lip-reading: moving from the dream to the reality
نویسندگان
چکیده
In the quest for greater computer lip-reading performance there are a number of tacit assumptions which are either present in the datasets (high resolution for example) or in the methods (recognition of spoken visual units called “visemes” for example). Here we review these and other assumptions and show the surprising result that computer lip-reading is not heavily constrained by video resolution, pose, lighting and other practical factors. However, the working assumption that visemes, which are the visual equivalent of phonemes, are the best unit for recognition does need further examination. We conclude that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملReading English in the Computer Lab
The present study compares the performance of two TEFL reading classes: one taking place in a regular classroom and the other held in a computer lab, with the learners practicing reading online. The results of an independent samples t-test showed that the difference between the learners’ scores on their reading comprehension post-tests and pretests did not differ statistically significantly fro...
متن کاملThe world as a Dream, a Comparative Study in Ibn Arabi’s Thought and the Film Inception
Among movies that considered «dream», "Inception" got a notable fortune. The film, written and directed by Christopher Nolan, has been the source of many academic discussions around the world as well as having won many credible film awards. On the other side, Ibn Arabi, who has referred to him as the designer and theorist of theoretical mysticism, systematically has explained this subject based...
متن کامللبخوانی: روش جدید احراز هویت در برنامههای کاربردی گوشیهای تلفن همراه اندروید
Today, mobile phones are one of the first instruments every individual person interacts with. There are lots of mobile applications used by people to achieve their goals. One of the most-used applications is mobile banks. Security in m-bank applications is very important, therefore modern methods of authentication is required. Most of m-bank applications use text passwords which can be stolen b...
متن کامللبخوانی و ادراک گفتار دانشآموزان کمشنوای مدارس ویژۀ کمشنوایان در شهر تهران
Objective: The goal of this study was to evaluate the lip reading ability and Speech perception of hearing impaired students of special schools for the hearing impaired in different speech levels. Materials & Methods: In this cross- sectional study, 44 deaf students (9-12 years old) were selected with multi-stage cluster sampling method, from two special schools for the deaf in Tehran. Tools...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1710.01084 شماره
صفحات -
تاریخ انتشار 2017